AITopics | Rohnert Park

Collaborating Authors

Rohnert Park

Can OpenAI o1 outperform humans in higher-order cognitive thinking?

Latif, Ehsan, Zhou, Yifan, Guo, Shuchen, Shi, Lehong, Gao, Yizhu, Nyaaba, Matthew, Bewerdorff, Arne, Yang, Xiantong, Zhai, Xiaoming

arXiv.org Artificial IntelligenceDec-7-2024

This study evaluates the performance of OpenAI's o1-preview model in higher-order cognitive domains, including critical thinking, systematic thinking, computational thinking, data literacy, creative thinking, logical reasoning, and scientific reasoning. Using established benchmarks, we compared the o1-preview models's performance to human participants from diverse educational levels. o1-preview achieved a mean score of 24.33 on the Ennis-Weir Critical Thinking Essay Test (EWCTET), surpassing undergraduate (13.8) and postgraduate (18.39) participants (z = 1.60 and 0.90, respectively). In systematic thinking, it scored 46.1, SD = 4.12 on the Lake Urmia Vignette, significantly outperforming the human mean (20.08, SD = 8.13, z = 3.20). For data literacy, o1-preview scored 8.60, SD = 0.70 on Merk et al.'s "Use Data" dimension, compared to the human post-test mean of 4.17, SD = 2.02 (z = 2.19). On creative thinking tasks, the model achieved originality scores of 2.98, SD = 0.73, higher than the human mean of 1.74 (z = 0.71). In logical reasoning (LogiQA), it outperformed humans with average 90%, SD = 10% accuracy versus 86%, SD = 6.5% (z = 0.62). For scientific reasoning, it achieved near-perfect performance (mean = 0.99, SD = 0.12) on the TOSLS,, exceeding the highest human scores of 0.85, SD = 0.13 (z = 1.78). While o1-preview excelled in structured tasks, it showed limitations in problem-solving and adaptive reasoning. These results demonstrate the potential of AI to complement education in structured assessments but highlight the need for ethical oversight and refinement for broader applications.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2412.05753

Country:

North America > United States > Georgia > Clarke County > Athens (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
(5 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Education > Educational Setting > Higher Education (1.00)
Education > Curriculum > Subject-Specific Education (1.00)
Health & Medicine (0.93)
Education > Assessment & Standards (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.72)

Add feedback

AI sports betting platforms' breaches likely impacting March Madness wagers

FOX NewsApr-5-2024, 06:00:50 GMT

Fox News Flash top sports headlines are here. Check out what's clicking on Foxnews.com. College basketball fans from across the country spent the past couple of weeks keeping a close eye on the NCAA Division I men's and women's basketball tournaments. Millions of sports enthusiasts filled out and submitted brackets with hopes their particular games' predictions would become true. The annual basketball tournament seemingly always sparks a noticeable amount of excitement across the sports world, but it also attracts the casual fan and those who might not normally participate in sports gambling.

fox new digital, platform, sports, (13 more...)

FOX News

Country:

Asia > China (0.06)
North America > United States > Connecticut > Tolland County > Storrs (0.05)
North America > United States > California > Sonoma County > Rohnert Park (0.05)

Industry: Leisure & Entertainment > Sports > Basketball (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Visual Response to Emotional State of User Interaction

Marhamati, Nina, Creston, Sena Clara

arXiv.org Artificial IntelligenceMar-27-2023

This work proposes an interactive art installation "Mood spRing" designed to reflect the mood of the environment through interpretation of language and tone. Mood spRing consists of an AI program that controls an immersive 3D animation of the seasons. If the AI program perceives the language and tone of the users as pleasant, the animation progresses through idealized renditions of seasons. Otherwise, it slips into unpleasant weather and natural disasters of the season. To interpret the language and tone of the user interaction, hybrid state-of-the-art emotion detection methods are applied to the user audio and text inputs. The emotional states detected separately from tone and language are fused by a novel approach that aims at minimizing the possible model disparity across diverse demographic groups.

arxiv preprint arxiv, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2303.17608

Country:

North America > United States > California > Sonoma County > Rohnert Park (0.05)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre: Research Report (0.70)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Human Computer Interaction > Interfaces (0.68)
(2 more...)

Add feedback

ChatGPT: The End of Online Exam Integrity?

Susnjak, Teo

arXiv.org Artificial IntelligenceDec-19-2022

This study evaluated the ability of ChatGPT, a recently developed artificial intelligence (AI) agent, to perform high-level cognitive tasks and produce text that is indistinguishable from human-generated text. This capacity raises concerns about the potential use of ChatGPT as a tool for academic misconduct in online exams. The study found that ChatGPT is capable of exhibiting critical thinking skills and generating highly realistic text with minimal input, making it a potential threat to the integrity of online exams, particularly in tertiary education settings where such exams are becoming more prevalent. Returning to invigilated and oral exams could form part of the solution, while using advanced proctoring techniques and AI-text output detectors may be effective in addressing this issue, they are not likely to be foolproof solutions. Further research is needed to fully understand the implications of large language models like ChatGPT and to devise strategies for combating the risk of cheating using these tools. It is crucial for educators and institutions to be aware of the possibility of ChatGPT being used for cheating and to investigate measures to address it in order to maintain the fairness and validity of online exams for all students.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.09292

Country:

Asia > India (0.05)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > Ohio (0.04)
North America > United States > California > Sonoma County > Rohnert Park (0.04)

Genre: Research Report > New Finding (0.89)

Industry:

Education > Educational Setting > Online (1.00)
Information Technology > Security & Privacy (0.93)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback